What is claimed is: 

1. A system for\ pan-genomic determination of three- 
dimensional macromolecular atomic structures, comprising: 

database means f©r systematically organizing all known 
structural information Vinto a genomics database of structural 
information, sequence information and functional information; 

bioinf ormatics means for using said structural 
information, sequence information and functional information 
stored in said database \to cluster a plurality of known gene 
products into a plurality of families of homologous sequences; 

protein synthesis mekns for synthesizing for each family, 
in parallel simultaneously , a plurality of member proteins 
using genomics information of a plurality of appropriately 
representative species; \ 

screening means for screening the synthesized proteins 
to determine ones that are\ effective ; 

protein processing means for preparing, purifying and 
characterizing the screened proteins determined to be 
effective by said screening Weans ; 

crystallization means nor crystallizing a plurality of 
the purified proteins in parallel against a plurality of 
crystallization screens, and\ testing a plurality of grown 
crystals for predetermined diffraction characteristics to 
determine suitable ones of said plurality of grown crystals; 

X-ray crystallography means for performing high- 
throughput crystallography, saici X-ray crystallography means 
having diffraction measuring mesons for measuring a suitable 
crystal for diffraction data, analyzing means for analyzing 
said diffraction data, means for building an atomic model, and 
means for refining said model against said diffraction data 
and storing the refined model in said database; 

structure extraction means having means for analyzing the 
refined model using sequence information of other family 
members and information of other known three-dimensional 
structures, means for analyzing for functional motifs and for 
surface characteristics to definA active sites and 
macromolecular contact sites, and means \f or defining at least 



one class of compound^ predicted to have binding potency using 
the active sites information; and 

homology model building means for developing a homology 
model using an atomic model retrieved from said database, 
5 wherein an ensemble of all known structures is used to 

further advance an effectiveness of said bioinf ormatics means. 

2. A system according Ao claim 1, further comprising: 

cryoprotection means for freezing said suitable ones of 
10 said plurality of crystals which are supplied by said 
crystallization means, \ 

wherein said suitable crystal is frozen by said 
cryoprotection means before being measured for said 
diffraction data by said diffraction measuring means. 
15 \ 

3. A system according to claim 1, wherein 

said protein synthesis means includes cloning means for 
cloning for said each family, in parallel simultaneously, 
cDNAs of said plurality ofi appropriately representative 
2 0 species into a plurality of expression vectors for a plurality 
of expressions systems, \ 

said screening means screens for expression constructs 
obtained from said cloning means to determine ones that are 
effective, and \ 

2 5 said protein processing meajis processes the expressed 

proteins determined to be effectiv^e by said screening means. 

4. A system according to claim 1, wherein 

said X-ray crystallography means includes a synchrotron 

3 0 storage ring having undulator beamlines for high- throughput 

crystallography by a multiwavelength\ anomalous diffraction 
method, and \ 

said analyzing means analyzes said diffraction data by 
a multi wavelength anomalous diffraction! phasing method. 
35 \ 

5. A system according to claim 4, wherein selenomethionine 
is incorporated in the synthesized proteins by said protein 
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synthesis means, and said analyzing means using said 
multiwavelength anomalous diffraction phasing method analyzes 
said diffraction datalof selenomethionyl proteins. 

5 6. A system according co claim 1, wherein said homology model 
is used in at least one! of target selection, drug design, and 
design of more appropriate constructs for experimental 
analysis. \ 

10 7. A process for pari-genomic determination of three- 
dimensional macromolecular atomic structures, comprising the 
steps of: \ 

systematically organizing all known structural 
information, sequence infoAnation and functional information 
15 into a genomics database; \ 

clustering a plurality\of known gene products into a 
plurality of families of homologous sequences using at least 
one bioinf ormatics tools; \ 

synthesizing for each famivLy, in parallel simultaneously, 

2 0 a plurality of member proteins Vising genomics information of 

a plurality of appropriately representative species; 

screening the synthesized prbteins to determine ones that 
are effective; \ 

preparing, purifying and characterizing the screened 
25 proteins determined to be effective \in said step of screening; 

crystallizing a plurality o^f purified proteins in 
parallel against a plurality of crystallization screens; 

testing a plurality of grown crystals for predetermined 
diffraction characteristics to determine suitable ones of said 

3 0 plurality of grown crystals; \ 

performing high- throughput crystallography, including 
measuring a suitable crystal for diffraction data, build an 
atomic model according to an analysis of said diffraction 
data, refining said model against said diffraction data, and 
35 storing the refined model in said database): 

analyzing the refined model using sequence information 
from other family members and inf ormation\ of other known 
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three- dimensional structures, analyzing for functional motifs 
and for surface cliairacteristics to define active sites and 
macromolecular contact sites, and defining at least one class 
of compounds predicted to have binding potency using the 
5 active sites information; and 

developing a homology model using computational tools for 
homology model buildincrt and an atomic model retrieved from 
said database, 1 

wherein said database is updated by using said 
10 bioinf ormatics tools almng with said all known structural 
information and an ensemble of all known structures is used 
to further advance an effectiveness of said bioinf ormatics 
tools. \ 

15 8. A process according to claim 7, further comprising the 
step of : \ 

freezing selected ones of said plurality of grown 
crystals determined to be suitable in said step of testing, 
wherein said suitable arystal is frozen before being 

2 0 measured for said diffraction diata in said step of measuring. 

9. A process according to claim 7, wherein 

said step of synthesis includes the step of cloning for 
said each family, in parallel simultaneously, cDNAs from said 
25 plurality of appropriately representative species into a 
plurality of expression vectors far a plurality of expressions 
systems, \ 

constructs obtained in said step of cloning are screened 
for expression to determine ones that are effective, and 

3 0 the expressed proteins determined to be effective are 

processed in said step of preparing, purifying and 
characterizing. \ 

10. A process according to claim 7, wherein 

35 said high- throughput crystallography is performed using 

a synchrotron storage ring having unduUator beamlines along 
with a multi wave length anomalous diffraction method, and 



* • 

-31- 

said dif f ractiorAdata is analyzed using a multi wave length 
anomalous dif fraction \phasing method. 

11. A process according to claim 10, wherein selenomethionine 
5 is incorporated in the synthesized proteins, and said 

multiwavelength anomalous diffraction phasing method is used 
to analyze said diffraction data of selenomethionyl proteins. 

12. A process according to claim 7, further comprising the 
10 step of \ 

using said homology model in at least one of target 
selection, drug design, and design of more appropriate 
constructs for experimental analysis. 



